Session 4: Natural Language
نویسنده
چکیده
Collectively, the papers in this session are mainly concerned with parsing, semantic interpretation, and inference. Interest in these processes can be motivated if we recognize that the overall goal of the field of NLP is the manipulation of natural language in ways that depend on meaning. Parsing, the recovery of the linguistic or grammatical structure of a natural-language utterance, is of concern to NLP because, in general, the meaning of a natural-language utterance depends on its structure. An example that illustrates this point is the sentence The man with the umbrella opened the door. If we tried to process this sentence without paying attention to its grammatical structure, we could easily be misled by the fact that it contains the substring ~he umbrella opened. But this sentence has nothing to do with umbrellas opening. Because of the structure of the sentence, the umbrella must be grouped with with and not with opened. If the central concern of NLP is the manipulation of natural language in ways that depend on meaning, then semantic interpretation would naturally be expected to play a central role. In practice, semantic interpretation in NLP usually means recovering a representation of the meaning of an utterance that encodes that meaning in a more transparent way than does the utterance itself. IIow does this contribute to the goal of manipulating language in meaning-dependent ways? We want to have algorithms that manipulate language according to meaning, but meaning is ultimately an abstraction that algorithms can have no direct access to. Algorithms can directly manipulate expressions only according to their structure. Thus we need expressions whose structure corresponds in a very direct way to their meaning. While, as we have argued above, the meaning of a natural-language expression is dependent on its structure , this dependence can be very indirect. By recovering an expression that encodes the meaning of an utterance more directly, we can create modular algorithms that consist of interacting pieces that each look only at a small piece of the structure of the meaning representation. If the pieces of the meaning representation fit together in a natural way that reflects the overall meaning of the utterance , then the algorithms that manipulate them will also be able to fit together in a natural way that reflects the overall meaning of the utterance. Finally, inference is the pay-off for the previous phases of parsing and semantic interpretation, being …
منابع مشابه
Session 4: Statistical Language Modeling
Corpus based Natural Language Processing (NLP) is now a well established paradigm in NLP. The availability of large corpora, often annotated in various way has led to the development of a variety of approaches to statistical language modeling. The papers in this session represent many of these important approaches. I will try to classify these papers along different dimensions, thus providing t...
متن کاملSession 11 - Natural Language III
The five papers in this session, as well as the ten papers in the other two natural language sessions, can be classified into three broad categories: (1) statistical approaches to natural language processing and the automatic acquisition of linguistic structure (2 out of 5 papers in this session; 8 out of 15 overall); (2) robust processing of texts by combining multiple partial analyses (2 out ...
متن کاملSpoken Language Systems II
The papers in this session were concerned with higher-level processing in speech recognition systems and, in some cases, the interface between the speech-recognition and natural-language components of a spoken language system. The session consisted of talks from four DARPA sites, Dragon Systems, SRI International, BBN Systems and Technologies, and MIT Lincoln Laboratory. These talks were follow...
متن کاملSession 5: Natural Language, Discourse
The papers in this group cover a broad range of topics from different perspectives. They have in common an emphasis on the handling of frequently-occurring phenomena in real data sets of spoken and written language, phenomena which are in some sense outside of the scope of some of the core problems in human language technologies. We can view problems such as acoustic processing, word recognitio...
متن کاملSession 2: Language Modeling
This session presented four interesting papers on statistical language modeling aimed for improved large-vocabulary speech recognition. The basic problem in language modeling is to derive accurate underlying representations from a large amount of training data, which shares the same fundamental problem as acoustic modeling. As demonstrated in this session, many techniques used for acoustic mode...
متن کاملSession 9: Automatic Acquisition of Linguistic Structure
This special session was devoted to a rapidly expanding focus of research in natural language processing. Five years ago, only two or three pioneering researchers were attempting to automatically extract and utilize higher level linguistic structure from large corpora; all of this work was within the stochastic modeling tradition. The seven papers presented within this session provide evidence ...
متن کامل